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SHARING OF INTERRUPTS 
BETWEEN OPERATING ENTITIES 

TECHNICAL FIELD 

[0001] Embodiments of the invention relates to support for operating entities such as 
virtual machines or threads. More particularly, the invention relates to techniques for 
sharing of interrupts between operating entities. 

BACKGROUND 

[0002] Computer hardware and operating systems currently exist that support 
multiple virtual machines that can have independent operating systems and that execute 
applications without interaction with other virtual machines. The virtual machines may 
have access to different subsets of devices and/or resources provided by or through the 
computer hardware. Figure 1 is a block diagram of a computer system that supports 
virtual machines. 

[0003] Host machine 100 represents the hardware of the computer system. Host 
machine 100 includes, for example, one or more processors, one or more bus systems, 
mass storage, read-only memory, input/output devices, a power supply, etc. Host 
machine 100 represents the many computer systems known in the art can support virtual 
machines. Host monitor 1 10 provides an interface between the hardware of host machine 
100 and virtual machines 140 and 150. Each virtual machine can have an independent 
operating system. For example, virtual machine 140 runs guest operating system 145 and 
virtual machine 150 runs guest operating system 155. 

[0004] When host machine 100 receives an interrupt signal, host monitor cannot 
determine which virtual machine should process the interrupt. Thus, host monitor 110 
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forwards the interrupt message to all virtual machines. Because generally only one of the 
virtual machines processes the interrupt the remaining virtual machines treat the interrupt 
signal as a spurious interrupt signal. The context switching overhead associated with 
sending an interrupt signal to all virtual machines results in computational inefficiencies. 
These inefficiencies are compounded if one or more of the virtual machines calls an 
interrupt service routine (ISR) chain to process the interrupt signal, which results in 
multiple ISRs being called. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is illustrated by way of example, and not by way of limitation, in 
the figures of the accompanying drawings in which like reference numerals refer to 
similar elements. 

Figure 1 is a block diagram of a computer system that supports virtual machines. 

Figure 2 is a conceptual illustration of interrupt processing using an interrupt 
service routine chain. 

Figure 3 is a conceptual illustration of interrupt processing with multiple virtual 
machines each using an interrupt service routine chain. 

Figure 4 is a conceptual illustration of interrupt processing with multiple virtual 
machines each using an interrupt service routine chain in which interrupt signals are 
selectively passed to one or more virtual machine operating systems. 

Figure 5 is a block diagram of one embodiment of a circuit for selective assertion 
of interrupt signals to virtual machines. 

Figure 6 is a flow diagram of one embodiment of a technique for selective 
assertion of interrupt signals to virtual machines. 
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DETAILED DESCRIPTION 

[0005] In the following description, numerous specific details are set forth. However, 
embodiments of the invention may be practiced without these specific details. In other 
instances, well-known circuits, structures and techniques have not been shown in detail in 
order not to obscure the understanding of this description. 

[0006] In many computer system architectures, devices coupled to an interconnect, 
for example, a Peripheral Component Interconnect (PCI) bus or a PCI Express 
interconnect typically share interrupt signals between multiple devices. The various PCI 
standards are managed by the PCI Special Interest Group of Portland, Oregon. Specific 
PCI standards and versions are discussed in greater detail below. Other, non-PCI 
standard interfaces can also provide shared interrupt signaling. 
[0007] These shared signals utilize level-triggered semantics allowing multiple 
interrupt service routines (ISRs) to be associated with the same interrupt line. When an 
interrupt signal is asserted, an operating system calls all ISRs registered with the 
operating system (the ISR chain) for the given interrupt signal in turn until an ISR for the 
device asserting the signal claims the interrupt and services the interrupting device thus 
de-asserting the interrupt signal. An example of interrupt processing using an ISR chain 
for a single operating system is illustrated in Figure 2. 

[0008] Using the example of Figure 2, device 200 generates an interrupt by asserting 
interrupt signal 205, which is interpreted by operating system (OS) 210. In response to 
interrupt signal 205, OS 210 causes ISR chain invocation 215. Receipt of interrupt 
signals and invocation of ISR chains is known in the art and any appropriate technique 
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can be used. The ISR chain can include any number of ISRs including, for example, ISR 
220, ISR 230 and ISR 240. 

[0009] In the example of Figure 2, it is possible that the operating system may 
receive a spurious interrupt due to a race condition between a device de-asserting an 
interrupt signal and the operating system issuing an end-of-interrupt (EOI) command to 
the interrupt controller. In this case, the operating system invokes the ISR chain and 
none of the ISRs claim the interrupt, which causes the operating system to dismiss the 
interrupt as spurious. 

[0010] Conceptually, use of an ISR chain is computationally inefficient because 
multiple ISRs are called, but only one services the interrupt. In practice, however, in 
many situations, the latency penalty is relatively small and is considered an acceptable 
trade off for reduced design complexity. However, when the concept of an ISR chain is 
applied to a system running multiple virtual machines, the overhead involved with 
executing multiple ISR chains becomes increasingly detrimental to overall system 
performance. 

[0011] Figure 3 is a conceptual illustration of interrupt processing with multiple 
virtual machines each using an interrupt service routine chain. Device 300 asserts 
interrupt signal 310 that is communicated to host monitor 320. Traditionally, 
determining the source of an interrupt signal on a shared interrupt interface is performed 
by software that is aware of the programming interface of the device. In a virtual 
machine environment, the awareness is provided by operating system specific device 
drivers running on the respective virtual machines and thus cannot be utilized by the host 
monitor to determine the source of the interrupt signal. 
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[0012] In the example of Figure 3, when host monitor 320 receives the interrupt 
signal, a virtual machine interrupt signal (e.g., VM1 Interrupt 330, VM2 interrupt 332, 
VM3 interrupt 334) is generated for each virtual machine running on the host machine. 
The virtual machine interrupt signals are interpreted by the respective virtual machine 
operating systems (e.g., Virtual Machine 1 Guest OS 340, Virtual Machine 2 Guest OS 
350, Virtual Machine 3 Guest OS 360) to invoke ISR chains (e.g., ISR chain invocation 
345, ISR chain invocation 355, ISR chain invocation 365) for the respective virtual 
machines. 

[0013] Each ISR chain includes multiple ISRs that are not the ISR required for 
processing the interrupt. For example, assuming an ISR registered with Virtual Machine 
3 Guest OS 360, ISR chain 1 that can include ISRs 370, 372 and 374 as well as additional 
ISRs and ISR chain 2 that can include ISRs 380, 382 and 384 as well as additional ISRs 
treat the interrupt signal as a spurious interrupt. This amplifies the computational 
inefficiencies of the single ISR chain case described above. In addition to the 
inefficiencies of the ISR chains, there is a context switch overhead involved that can 
cause greater computational inefficiencies. 

[0014] As the interrupt is processed by ISR chain 3 that can include ISRs 390, 392 
and 394 as well as additional ISRs, traditional ISR chain processing identifies the proper 
ISR for the interrupt from device 300. As illustrated by the example of Figure 3, the use 
of multiple ISR chains in a virtual machine environment can result in significant resource 
consumption. As the number of virtual machines increases and/or the number of ISRs in 
the ISR chains increases the inefficiencies increase. As described below, some (or even 
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all) of the inefficiencies of the architecture of Figure 3 can be eliminated to provide a 
more efficient interrupt processing technique in a virtual machine environment. 
[0015] Figure 4 is a conceptual illustration of interrupt processing with multiple 
virtual machines each using an interrupt service routine chain in which interrupt signals 
are selectively passed to one or more virtual machine operating systems. While the 
techniques of Figure 4 are described with respect to a virtual machine environment, these 
techniques can be used in alternate environments, for example, in a multi-threaded 
environment. 

[0016] Device 400 asserts interrupt signal 410 that is communicated to host monitor 
420. Interrupt signal 410 is communicated to host monitor, at least in part, over a shared 
interface, for example, a bus conforming to certain versions of the PCI standards. When 
host monitor 420 responds to an interrupt signal assertion, an interrupt status bit 
corresponding to each of the devices sharing the interrupt signal are checked to determine 
the source of the interrupt signal. The PCI Local Bus Specification, Revision 2.3 
published March 29, 2002 as well as the PCI Express Base Specification, Version 1.0 
published July 23, 2002 and Version 1.0a published October 7, 2003 define an interrupt 
status bit in the PCI Status register of devices conforming to these specifications. For 
example, the interrupt status bit is defined as bit 3 of the device status register in Section 
6.2 PCI Local Bus Specification, Revision 2.3. Subsequent standards may also define the 
interrupt status bit or provide another interrupt status indication that can be used as 
described herein. 

[0017] Host monitor 420 can then use the identity of the device to determine which 
virtual machines are associated with the identified device. Host monitor then selectively 



42P 17830 



-8- 



Express Mail No. EV 325532153 US 



generates (or passes) interrupt signals only for (or to) the virtual machines that are 
associated with the device generating the interrupt. This is illustrated in Figure 4 by 
Selective VM1 interrupt 430, Selective VM2 interrupt 432 and Selective VM3 interrupt 
434. 

[0018] The selective interrupt signals are only generated for the virtual machines 
associated with the device generating the interrupt. In one embodiment, the selective 
interrupt signals include an identifier corresponding to the device asserting the interrupt 
signal. In alternate embodiments, the selective interrupt signals do not include the device 
identifier. The selective interrupt signals can be a version of interrupt signal 410 that is 
selectively passed to selected virtual machines. That is, interrupt signal 410 can be 
blocked from virtual machines that do not have access to the device asserting interrupt 
signal 410. 

[0019] The interrupt status bit in the PCI Status register is described in the PCI 
standards as a building block type of functionality without any specific usage. Thus, the 
PCI standards do not define use of the interrupt status bit as described herein. 
[0020] The technique of Figure 4 avoids unnecessary context switch performance 
penalties for virtual machines that would otherwise have processed the interrupt as a 
spurious interrupt signal. Note that there is no possibility for loss of interrupts due to the 
level triggered nature of the interrupt signal. Host monitor 420 can share level triggered 
interrupts between devices assigned to separate guest virtual machines without incurring 
context switch penalties for virtual machines that are not associated with a device 
asserting the interrupt signal. 
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[0021] Figure 5 is a block diagram of one embodiment of a circuit for selective 
assertion of interrupt signals to virtual machines. In one embodiment, the circuit of 
Figure 5 is implemented for each device having the ability to assert an interrupt signal via 
a shared interrupt interface (e.g., a PCI bus). Variations of the circuit of Figure 5 can be 
used to achieve the same or a similar result. 

[0022] Device-specific interrupt status indicator 500 provides an indication of 
whether an associated device has asserted a shared interrupt signal. For example, 
interrupt status indicator 500 can be a register having one or more bits indicating interrupt 
status of one or more devices. Device-specific interrupt enable indicator 510 operates to 
enable checking of device-specific interrupt status indicator 500 as both indicators are 
coupled to provide input signals to logic gate 520. In one embodiment, logic gate 520 is 
an AND gate. Alternate embodiments, of determining whether a specific device has 
asserted an interrupt signal on a shared interrupt interface can also be used. 
[0023] The output of logic gate 520 is the INTx Status signal, which indicates 
whether the corresponding device is asserting the interrupt signal. The INTx Status 
signal also indicates whether the device is asserting the interrupt signal when the INTx 
signal is disabled by the INTx Disable signal. In one embodiment, the INTx Disable 
signal and the INTx Status signal are input signals to logic gate 530, which generates the 
INTx signal. In one embodiment, logic gate 530 is an AND gate. 
[0024] The INTx signal is the interrupt signal that is propagated over the shared 
interrupt interface (e.g., a bus conforming to PCI 2.3 or PCI Express). By monitoring the 
INTx signals and the corresponding INTx Status signals, the host monitor can determine 
the device that has asserted the interrupt signal. 
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[0025] Figure 6 is a flow diagram of one embodiment of a technique for selective 
assertion of interrupt signals to virtual machines. An interrupt signal is received via a 
shared interface, 610. As mentioned above, the shared interface can be, for example, a 
PCI bus, or a bus conforming to a different standard that allows multiple devices to share 
an interrupt signal. 

[0026] The device that asserted the interrupt signal is identified, 620. The device 
asserting the interrupt signal can be identified, for example, via a register having an 
interrupt status bit as in the PCI Express standard, or via a non-standard register. In one 
embodiment, a host monitor determines the device that asserted the interrupt signal and 
also determines the virtual machines (or, alternatively, the threads) having access to the 
device that asserted the interrupt signal. 

[0027] An interrupt indication is forwarded to the virtual machines (or threads) that 
are registered to use the device that asserted the interrupt signal, 630. In one 
embodiment, the interrupt indication includes an identifier of the device. In alternate 
embodiments, the interrupt indication does not include the device identifier, but is a 
forwarded version of the originally asserted interrupt signal. 

[0028] The virtual machines that receive the interrupt indication invoke one or more 
interrupt service routines to process the interrupt, 640. In one embodiment, each virtual 
machine invokes an interrupt service routine chain to process the interrupt. The interrupt 
is processed by one of the interrupt service routines invoked by one of the virtual 
machines, 650. 

[0029] Reference in the specification to "one embodiment" or "an embodiment" 
means that a particular feature, structure, or characteristic described in connection with 
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the embodiment is included in at least one embodiment of the invention. The 
appearances of the phrase "in one embodiment" in various places in the specification are 
not necessarily all referring to the same embodiment. 

[0030] While the invention has been described in terms of several embodiments, 
those skilled in the art will recognize that the invention is not limited to the embodiments 
described, but can be practiced with modification and alteration within the spirit and 
scope of the appended claims. The description is thus to be regarded as illustrative 
instead of limiting. 



42P17830 



-12- 



Express Mail No. EV 325532153 US 



